Scalable Approximate Dynamic Programming Models with Applications in Air Transportation
نویسندگان
چکیده
SCALABLE APPROXIMATE DYNAMIC PROGRAMMING MODELS WITH APPLICATIONS IN AIR TRANSPORTATION Poornima Balakrishna, PhD George Mason University, 2009 Dissertation Co-Director: Dr. Rajesh Ganesan Dissertation Co-Director: Dr. Lance Sherry The research objective of the dissertation is to develop methods to address the curse of dimensionality in the field of approximate dynamic programming, to enhance the scalability of these methods to large-scale problems. Several problems, including those faced in day to day life involve sequential decision making in the presence of uncertainty. These problems can often be modeled as Markov decision processes using the Bellman’s optimality equation. Attempts to solve even reasonably complex problems through stochastic dynamic programming are faced with the curse of modeling and the curse of dimensionality. The curse of modeling has been addressed in the literature through the introduction of reinforcement learning strategies, a strand of approximate dynamic programming (ADP). In spite of considerable research efforts, curse of dimensionality which affects the scalability of ADP for large scale applications still remains a challenge. In this research, a value function approximation method based on the theory of diffusion wavelets is investigated to address the scalability of ADP methods. The first contribution of this dissertation is an advancement of the state-of-the-art in the field of stochastic dynamic programming methods that are solved using ADP approaches. An important intellectual merit is the innovatively designed diffusion wavelet based value function approximation method which is integrated with ADP to address the curse of dimensionality. The innovation lies in this integration that exploits the structure of the problem to achieve computational feasibility. The ADP method with diffusion wavelet based value function approximation is tested on the problem of taxi-out time estimation of aircrafts (time duration between gate-pushback and wheels-off) to establish a proof of concept for the research objective. The second contribution of this dissertation is the modeling of the taxi-out time estimation of flights as a stochastic dynamic programming problem with the capability to provide sequential predictions in real-time as the system evolves. The model aims to accurately predict the taxi-out time of a flight at least fifteen minutes before its scheduled gate pushback time. As a case study for Detroit International Airport, results indicate that there is a 6 % to 12 % increase in the percentage of flights predicted accurately (with a root mean square error of two minutes) using ADP when compared with a regression model for taxi-out time predictions. The outcomes of this dissertation research provide a generic methodology for sequential decision making under uncertainty in large scale applications by uniting concepts from signal processing, statistics, stochastic processes, and artificial intelligence, which may provide solutions for future automated decision making in large scale complex applications in other engineering domains. Chapter 1: INTRODUCTION 1.1 Research Objective The research objective of the dissertation is to develop methods to address the “curse of dimensionality” in the field of approximate dynamic programming, to enhance the scalability of these methods to large-scale problems. The goals of this dissertation are, 1. To investigate a new value function approximation method based on diffusion wavelet theory to mitigate the “curse of dimensionality” in approximate dynamic programming (ADP) approaches. 2. To cast the problem of taxi-out time estimation of an aircraft as a stochastic dynamic programming model, and solve it using an ADP approach. The model aims to predict taxi-out time of a flight at least 15 minutes in advance of its scheduled gate pushback time with a root mean square error of 2 minutes. 3. To test the diffusion wavelet based value function approximation method for ADP approaches on the taxi-out time estimation problem, and examine the performance with state-of-the-art ADP methods to establish a proof of concept for the research objective. The scope of this dissertation research is illustrated in Figure 1.1 and is explained in the sections that follow.
منابع مشابه
OPTIMIZATION OF A PRODUCTION LOT SIZING PROBLEM WITH QUANTITY DISCOUNT
Dynamic lot sizing problem is one of the significant problem in industrial units and it has been considered by many researchers. Considering the quantity discount in purchasing cost is one of the important and practical assumptions in the field of inventory control models and it has been less focused in terms of stochastic version of dynamic lot sizing problem. In this paper, stochastic dyn...
متن کاملMeasuring a Dynamic Efficiency Based on MONLP Model under DEA Control
Data envelopment analysis (DEA) is a common technique in measuring the relative efficiency of a set of decision making units (DMUs) with multiple inputs and multiple outputs. Standard DEA models are quite limited models, in the sense that they do not consider a DMU at different times. To resolve this problem, DEA models with dynamic structures have been proposed.In a recent pape...
متن کاملApproximate Dynamic Programming in Transportation and Logistics: A Unified Framework
Deterministic optimization has enjoyed a rich place in transportation and logistics, where it represents a mature field with established modeling and algorithmic strategies. By contrast, sequential stochastic optimization models (dynamic programs) have been plagued by the lack of a common modeling framework, and by algorithmic strategies that just do not seem to scale to real-world problems in ...
متن کاملDynamic configuration and collaborative scheduling in supply chains based on scalable multi-agent architecture
Due to diversified and frequently changing demands from customers, technological advances and global competition, manufacturers rely on collaboration with their business partners to share costs, risks and expertise. How to take advantage of advancement of technologies to effectively support operations and create competitive advantage is critical for manufacturers to survive. To respond to these...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009